AITopics | cut layer

Collaborating Authors

cut layer

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

bb63841e1ad12370a34504f15c60db4f-Paper-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 19:21:32 GMT

artificial intelligence, machine learning, participation, (16 more...)

Neural Information Processing Systems

Country:

Asia > China (0.04)
North America > United States > Virginia (0.04)
North America > United States > California > Yolo County > Davis (0.04)
Asia > Nepal (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine (0.67)
Social Sector (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Data Heterogeneity and Forgotten Labels in Split Federated Learning

Tirana, Joana, Tsigkari, Dimitra, Noguero, David Solans, Kourtellis, Nicolas

arXiv.org Artificial IntelligenceNov-14-2025

In Split Federated Learning (SFL), the clients collaboratively train a model with the help of a server by splitting the model into two parts. Part-1 is trained locally at each client and aggregated by the aggregator at the end of each round. Part-2 is trained at a server that sequentially processes the intermediate activations received from each client. We study the phenomenon of catastrophic forgetting (CF) in SFL in the presence of data heterogeneity. In detail, due to the nature of SFL, local updates of part-1 may drift away from global optima, while part-2 is sensitive to the processing sequence, similar to forgetting in continual learning (CL). Specifically, we observe that the trained model performs better in classes (labels) seen at the end of the sequence. We investigate this phenomenon with emphasis on key aspects of SFL, such as the processing order at the server and the cut layer. Based on our findings, we propose Hydra, a novel mitigation method inspired by multi-head neural networks and adapted for the SFL's setting. Extensive numerical evaluations show that Hydra outperforms baselines and methods from the literature.

accuracy, artificial intelligence, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2511.09736

Country: Europe (0.93)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)

Add feedback

Convergence Analysis of Split Federated Learning on Heterogeneous Data

Neural Information Processing SystemsOct-10-2025, 14:59:08 GMT

However, FL is usually computationally intensive.

participation, sfl-v1, sfl-v2, (13 more...)

Neural Information Processing Systems

Country:

Asia > China (0.04)
North America > United States > Virginia (0.04)
North America > United States > California > Yolo County > Davis (0.04)
Asia > Nepal (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry:

Health & Medicine (0.67)
Social Sector (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.67)

Add feedback

Pigeon-SL: Robust Split Learning Framework for Edge Intelligence under Malicious Clients

Park, Sangjun, Quek, Tony Q. S., Seo, Hyowoon

arXiv.org Artificial IntelligenceAug-5-2025

This work has been submitted to the IEEE for possible publication. Abstract --Recent advances in split learning (SL) have established it as a promising framework for privacy-preserving, communication-efficient distributed learning at the network edge. However, SL's sequential update process is vulnerable to even a single malicious client, which can significantly degrade model accuracy. T o address this, we introduce Pigeon-SL, a novel scheme grounded in the pigeonhole principle that guarantees at least one entirely honest cluster among M clients, even when up to N of them are adversarial. In each global round, the access point partitions the clients into N + 1 clusters, trains each cluster independently via vanilla SL, and evaluates their validation losses on a shared dataset. We further enhance training and communication efficiency with Pigeon-SL+, which repeats training on the selected cluster to match the update throughput of standard SL.

artificial intelligence, machine learning, malicious client, (17 more...)

arXiv.org Artificial Intelligence

2508.02235

Country:

North America > United States (0.28)
Asia > South Korea (0.28)

Genre: Research Report (0.50)

Industry: Information Technology > Security & Privacy (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Security & Privacy (0.68)
Information Technology > Communications (0.67)

Add feedback

Collaborative Split Federated Learning with Parallel Training and Aggregation

Papageorgiou, Yiannis, Thomas, Yannis, Filippakopoulos, Alexios, Khalili, Ramin, Koutsopoulos, Iordanis

arXiv.org Artificial IntelligenceApr-23-2025

Federated learning (FL) operates based on model exchanges between the server and the clients, and it suffers from significant client-side computation and communication burden. Split federated learning (SFL) arises a promising solution by splitting the model into two parts, that are trained sequentially: the clients train the first part of the model (client-side model) and transmit it to the server that trains the second (server-side model). Existing SFL schemes though still exhibit long training delays and significant communication overhead, especially when clients of different computing capability participate. Thus, we propose Collaborative-Split Federated Learning(C-SFL), a novel scheme that splits the model into three parts, namely the model parts trained at the computationally weak clients, the ones trained at the computationally strong clients, and the ones at the server. Unlike existing works, C-SFL enables parallel training and aggregation of model's parts at the clients and at the server, resulting in reduced training delays and commmunication overhead while improving the model's accuracy. Experiments verify the multiple gains of C-SFL against the existing schemes. Keywords: Split learning Distributed AI systems 1 Introduction The wide deployment of devices that gather vast amounts of data, along with the stringent data privacy requirements of numerous applications (such as healthcare and natural language processing applications), drives the adoption of distributed learning schemes, like Federated Learning (FL) [9]. In FL, clients update their local model for multiple local epochs and send it to a server, which aggregates all local models and transmits the aggregated model back to the clients for the next training round. This synchronous process is repeated for multiple rounds.

artificial intelligence, communication overhead, machine learning, (14 more...)

arXiv.org Artificial Intelligence

2504.15724

Country: Europe > Germany (0.28)

Genre: Research Report (0.84)

Industry: Information Technology > Security & Privacy (0.88)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Energy-Efficient Split Learning for Fine-Tuning Large Language Models in Edge Networks

Li, Zuguang, Wu, Shaohua, Li, Liang, Zhang, Songge

arXiv.org Artificial IntelligenceJan-13-2025

In this letter, we propose an energy-efficient split learning (SL) framework for fine-tuning large language models (LLMs) using geo-distributed personal data at the network edge, where LLMs are split and alternately across massive mobile devices and an edge server. Considering the device heterogeneity and channel dynamics in edge networks, a \underline{C}ut l\underline{A}yer and computing \underline{R}esource \underline{D}ecision (CARD) algorithm is developed to minimize training delay and energy consumption. Simulation results demonstrate that the proposed approach reduces the average training delay and server's energy consumption by 70.8% and 53.1%, compared to the benchmarks, respectively.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2412.0009

Country: Asia > China (0.30)

Genre: Research Report > New Finding (0.48)

Industry:

Energy (0.90)
Information Technology > Security & Privacy (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Add feedback

Just a Simple Transformation is Enough for Data Protection in Vertical Federated Learning

Semenov, Andrei, Zmushko, Philip, Pichugin, Alexander, Beznosikov, Aleksandr

arXiv.org Artificial IntelligenceDec-16-2024

Vertical Federated Learning (VFL) aims to enable collaborative training of deep learning models while maintaining privacy protection. However, the VFL procedure still has components that are vulnerable to attacks by malicious parties. In our work, we consider feature reconstruction attacks, a common risk targeting input data compromise. We theoretically claim that feature reconstruction attacks cannot succeed without knowledge of the prior distribution on data. Consequently, we demonstrate that even simple model architecture transformations can significantly impact the protection of input data during VFL. Confirming these findings with experimental results, we show that MLP-based models are resistant to state-of-the-art feature reconstruction attacks.

artificial intelligence, deep learning, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2412.11689

Country:

North America > United States > Virginia (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
North America > United States > California > Orange County > Anaheim (0.04)

Genre: Research Report (1.00)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Split Federated Learning Over Heterogeneous Edge Devices: Algorithm and Optimization

Sun, Yunrui, Hu, Gang, Teng, Yinglei, Cai, Dunbo

arXiv.org Artificial IntelligenceNov-21-2024

Split Learning (SL) is a promising collaborative machine learning approach, enabling resource-constrained devices to train models without sharing raw data, while reducing computational load and preserving privacy simultaneously. However, current SL algorithms face limitations in training efficiency and suffer from prolonged latency, particularly in sequential settings, where the slowest device can bottleneck the entire process due to heterogeneous resources and frequent data exchanges between clients and servers. To address these challenges, we propose the Heterogeneous Split Federated Learning (HSFL) framework, which allows resource-constrained clients to train their personalized client-side models in parallel, utilizing different cut layers. Aiming to mitigate the impact of heterogeneous environments and accelerate the training process, we formulate a latency minimization problem that optimizes computational and transmission resources jointly. Additionally, we design a resource allocation algorithm that combines the Sample Average Approximation (SAA), Genetic Algorithm (GA), Lagrangian relaxation and Branch and Bound (B\&B) methods to efficiently solve this problem. Simulation results demonstrate that HSFL outperforms other frameworks in terms of both convergence rate and model accuracy on heterogeneous devices with non-iid data, while the optimization algorithm is better than other baseline methods in reducing latency.

cut layer, latency, training latency, (14 more...)

arXiv.org Artificial Intelligence

2411.13907

Country:

Asia > China > Beijing > Beijing (0.05)
Europe > Spain > Catalonia > Barcelona Province > Barcelona (0.04)

Genre: Research Report > New Finding (0.34)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Federated Split Learning for Human Activity Recognition with Differential Privacy

Ndeko, Josue, Shaon, Shaba, Beal, Aubrey, Sahoo, Avimanyu, Nguyen, Dinh C.

arXiv.org Artificial IntelligenceNov-9-2024

This paper proposes a novel intelligent human activity recognition (HAR) framework based on a new design of Federated Split Learning (FSL) with Differential Privacy (DP) over edge networks. Our FSL-DP framework leverages both accelerometer and gyroscope data, achieving significant improvements in HAR accuracy. The evaluation includes a detailed comparison between traditional Federated Learning (FL) and our FSL framework, showing that the FSL framework outperforms FL models in both accuracy and loss metrics. Additionally, we examine the privacy-performance trade-off under different data settings in the DP mechanism, highlighting the balance between privacy guarantees and model accuracy. The results also indicate that our FSL framework achieves faster communication times per training round compared to traditional FL, further emphasizing its efficiency and effectiveness. This work provides valuable insight and a novel framework which was tested on a real-life dataset.

activity recognition, artificial intelligence, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2411.06263

Country:

North America > United States > Alabama (0.04)
North America > Canada > Newfoundland and Labrador > Labrador (0.04)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Information Technology > Smart Houses & Appliances (0.68)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Internet of Things (1.00)
Information Technology > Data Science (1.00)
(2 more...)

Add feedback

Semantic Meta-Split Learning: A TinyML Scheme for Few-Shot Wireless Image Classification

Eldeeb, Eslam, Shehab, Mohammad, Alves, Hirley, Alouini, Mohamed-Slim

arXiv.org Artificial IntelligenceSep-3-2024

Semantic and goal-oriented (SGO) communication is an emerging technology that only transmits significant information for a given task. Semantic communication encounters many challenges, such as computational complexity at end users, availability of data, and privacy-preserving. This work presents a TinyML-based semantic communication framework for few-shot wireless image classification that integrates split-learning and meta-learning. We exploit split-learning to limit the computations performed by the end-users while ensuring privacy-preserving. In addition, meta-learning overcomes data availability concerns and speeds up training by utilizing similarly trained tasks. The proposed algorithm is tested using a data set of images of hand-written letters. In addition, we present an uncertainty analysis of the predictions using conformal prediction (CP) techniques. Simulation results show that the proposed Semantic-MSL outperforms conventional schemes by achieving 20 % gain on classification accuracy using fewer data points, yet less training energy consumption.

accuracy, communication, prediction, (14 more...)

arXiv.org Artificial Intelligence

2409.12978

Country:

Europe > Finland > Northern Ostrobothnia > Oulu (0.04)
Asia > Nepal (0.04)
Asia > Middle East > Saudi Arabia (0.04)
Asia > Middle East > Jordan (0.04)

Genre: Research Report (0.84)

Industry: Information Technology > Security & Privacy (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.68)

Add feedback